Multi-Stream Deep Similarity Learning Networks for Visual Tracking
نویسندگان
چکیده
Visual tracking has achieved remarkable success in recent decades, but it remains a challenging problem due to appearance variations over time and complex cluttered background. In this paper, we adopt a tracking-by-verification scheme to overcome these challenges by determining the patch in the subsequent frame that is most similar to the target template and distinctive to the background context. A multi-stream deep similarity learning network is proposed to learn the similarity comparison model. The loss function of our network encourages the distance between a positive patch in the search region and the target template to be smaller than that between positive patch and the background patches. Within the learned feature space, even if the distance between positive patches becomes large caused by the appearance change or interference of background clutter, our method can use the relative distance to distinguish the target robustly. Besides, the learned model is directly used for tracking with no need of model updating, parameter fine-tuning and can run at 45 fps on a single GPU. Our tracker achieves state-of-the-art performance on the visual tracking benchmark compared with other recent real-time-speed trackers, and shows better capability in handling background clutter, occlusion and appearance change.
منابع مشابه
Deep Tracking: Visual Tracking Using Deep Convolutional Networks
In this paper, we study discriminatively trained deep convolutional networks for the task of visual tracking. Our tracker utilizes both motion and appearance features extracted from a pre-trained dual stream deep convolution network. By using optical flow and deep networks to implement a dual appearance and motion stream to inform tracking, our tracker outperforms current state of the art track...
متن کاملTracking of Humans in Video Stream Using LSTM Recurrent Neural Network
In this master thesis, the problem of tracking humans in video streams by using Deep Learning is examined. We use spatially supervised recurrent convolutional neural networks for visual human tracking. In this method, the recurrent convolutional network uses both the history of locations and the visual features from the deep neural networks. This method is used for tracking, based on the detect...
متن کاملA multi-scale convolutional neural network for automatic cloud and cloud shadow detection from Gaofen-1 images
The reconstruction of the information contaminated by cloud and cloud shadow is an important step in pre-processing of high-resolution satellite images. The cloud and cloud shadow automatic segmentation could be the first step in the process of reconstructing the information contaminated by cloud and cloud shadow. This stage is a remarkable challenge due to the relatively inefficient performanc...
متن کاملDeep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks
This paper discusses the problem of tracking from a deep learning approach. This experiment takes cues from how the brain is modeled to create deep convolutional networks that mimic how the human brain tracks objects. By using optical flow and deep networks to implement a dual appearance and motion stream, our tracker outperforms current state of the art methods.
متن کاملLearning Dual Multi-Scale Manifold Ranking for Semantic Segmentation of High-Resolution Images
Semantic image segmentation has recently witnessed considerable progress by training deep convolutional neural networks (CNNs). The core issue of this technique is the limited capacity of CNNs to depict visual objects. Existing approaches tend to utilize approximate inference in a discrete domain or additional aides and do not have a global optimum guarantee. We propose the use of the multi-lab...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017